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ABSTRACT 

This  paper  presents  a  hierarchical  architecture  for  the 
coordination  and  control  of  autonomous  agents 
performing  intelligent  team  operations.  Each  team, 
consisting  of  multiple  aerial  and  ground  vehicles, 
uses  a  coordinated  strategy  through  communication 
via  a  wireless  network.  As  an  exemplary  case  study,  a 
pursuit-evasion  scenario  is  developed.  This  paper 
also  introduces  the  experimental  setup  for  aerial  and 
ground-based  autonomous  agents.  The  proposed 
scheme  is  currently  under  development  for  near 
future  experiments. 


1.  Introduction 

Intelligent  multi-agent  systems  have  been  of  great 
interest  recently  because  they  offer  rich  sets  of 
challenging  questions  to  address:  optimization, 
coordination,  fault  detection-tolerance,  stability,  and 
communication  among  them.  The  BErkeley  AeRobot 
(BEAR)  project  at  University  of  California  at 
Berkeley  started  as  development  of  a  single 
autonomous  Unmanned  Aerial  Vehicle  (UAV)  and 
now  one  of  current  areas  of  the  research  activities  is 
the  creation  of  an  intelligent  network  of  ground  and 
aerial  vehicles  performing  coordinated  operations. 
The  goal  of  this  research  is  the  organization  of 
multiple  autonomous  agents  into  integrated  and 
intelligent  systems  with  reduced  control  and 
cognition  complexity,  fault-tolerance,  adaptivity  to 
changes  in  task  and  environment,  modularity  and 
scalability  to  perform  complex  missions  efficiently. 
This  project  encompasses  diverse  active  research 
topics:  1)  multi-agent  coordination,  2)  hybrid  system 
synthesis  and  verification,  3)  communication,  4) 


navigation  and  5)  vehicular  system  identification  and 
control  synthesis. 

Among  many  scenarios,  the  pursuit-evasion  or 
search-rescue  mission  is  particularly  attractive  since 
it  addresses  most  of  the  interesting  issues 
aforementioned.  One  team  of  agents  play  pursuers  or 
rescuers  trying  to  catch  a  team  of  evading  agents, 
which  move  around  in  random  or  intelligent  manner, 
or  locate  the  objects  of  interest  while  minimizing 
some  cost  function. 


Figure  1,  Berkeley  testbed  for  pursuit-evasion  game 

In  our  implementation,  UAVs  are  pursuers,  which 
build  a  probabilistic  map  of  the  field  using  a  vision 
system  in  real-time  and  assume  a  flight  profile  to 
capture  the  evaders.  In  doing  so,  UAVs  are  required 
to  fly  through  the  given  waypoints  or  hover  over 
certain  points.  Rotorcraft-based  UAVs  (RUAVs)  are 
very  attractive  for  this  application  because  of  their 
maneuverability.  Unmanned  ground  vehicles  (UGVs) 
play  the  role  of  evaders,  traveling  on  the  surface  of 
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the  arena,  following  a  certain  motion  rule.  Depending 
on  the  scenario,  some  UGVs  can  act  as  pursuers  in  a 
cooperative  network  with  UAVs, 

Wireless  communication  serves  as  the  backbone  to 
connect  individual  agents  for  exchanging  necessary 
information.  While  the  current  setup  assumes  a  fixed 
number  of  communication  nodes  throughout  the 
mission,  more  advanced  wireless  network 
architectures  using  dynamic  clustering  are  currently 
under  development. 

In  this  paper,  we  introduce  the  research  activities  for 
the  development  of  multi-agent  coordination  systems, 
emphasizing  the  system  architecture  and  realization 
of  pursuit-evasion  or  search-rescue  scenarios. 


2.  BEAR  System  Architecture 

Our  architecture  of  multi-agent  distributed  system  is 
inspired  by  hierarchical  hybrid  systems  [7],  This  will 
allow  integration  and  synchronization  of  global  plans 
from  local  intentions  of  each  agent  in  the  perspective 
of  multiple-UAV/UGV  operations.  Each  UAV 
consists  of  the  base  airframe  and  the  integrated 
avionics  systems,  which  includes  path  planning  and 
flight  control  system,  navigational  sensors,  and  a 
communication  module  (Figure  2),  UGVs  are 
similarly  constructed.  Strategy  planning  and 
probabilistic  map  building  of  the  environment  may  be 
done  on-board  or  in  ground  station  depending  on  the 
choice  of  informational  centralization, 

2.1  Strategy  Planning 

The  strategic  planner  is  responsible  for  overall 
planning  for  the  execution  of  mission.  It  decomposes 
a  mission  into  a  sequence  of  waypoints.  In  addition,  it 
acknowledges  the  completion  of  a  subtask,  such  as 
arrival  at  a  waypoint,  and  schedules  the  next  one. 
When  the  tactical  layer  notifies  this  layer  of  the 
potential  conflict,  it  will  generate  proper  constraints 
for  conflict-free  trajectories.  In  multi-agent  systems, 
this  layer  allocates  resource  needed  to  accomplish  the 
mission  efficiently. 

In  our  pursuit-evasion  or  search-rescue  type 
scenarios,  the  proper  policy  is  employed  to  generate 
waypoints  for  each  agent  in  this  layer.  Using  the 
collected  information  by  the  map  builder,  the  strategy 
planner  calculates  the  tactical  movements  of  the 
pursuing  agents  at  next  time  frame  and  distributes 
them  through  the  wireless  communication  network. 


Figure  2,  BEAR  System  Architecture 


2.2  Path  Planning  and  Regulation  Layer 

This  layer,  composed  of  the  trajectory  planner  and 
vehicle  regulation,  resides  on  each  agent  to  generate  a 
realizable  path  for  each  agent  to  follow  and  control 
the  host  vehicle  to  track  the  given  path. 

The  trajectory  planner  designs  a  realizable  trajectory 
for  each  agent  and  associated  flight  modes,  based  on 
a  detailed  dynamic  model  of  the  RUAV  and  the 
trajectoiy  from  tactical  planner.  Different  flight 
modes  such  as  take-off,  hover,  cruise,  turn,  etc.  may 
lead  to  multiple  sets  of  control  laws.  Moreover,  the 
resulting  trajectory  is  given  to  the  regulation  layer  to 
directly  control  the  dynamics  of  each  agent.  Thus,  the 
transfer  between  controllers  is  desired  to  be  bumpless 
and  the  issue  of  actuator  saturation  should  be 
considered  in  generating  trajectory  constraints. 

The  motion  request  by  Strategy  planner  is  transmitted 
to  this  layer  via  wireless  communication  network. 
The  motion  request  is  cast  in  the  form  of  Vehicle 
Control  Language  (VCL),  a  human-understandable 
script  language  proposed  by  Shim[10],  This  structure 
delivers  the  navigation  information  including 
coordinates  of  target  waypoint,  type  of  waypoints  and 
other  requirements.  Based  on  the  contents  of  YCL, 
PPR  determines  the  feasible  flight  mode,  generate 
reference  trajectory  in  realtime  and  feed  it  to  the 
integrated  regulation  layer. 

Regulation  layer  plays  the  important  role  to  stabilize 
the  inherently  unstable  dynamics  of  the  host  vehicle 
and  track  the  given  trajectory.  The  underlying 
feedback  controller  is  currently  based  on  classical 
multi-loop  SISO  controllers  as  shown  in  Figure  3. 
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The  feedback  compensation  gains  are  determined 
applying  classical  controller  design  framework  to  the 
LTI  model  for  hover  [10], 


Figure  3.  The  proposed  controller  architecture  using 
SISO  multi-loop  controllers 


23  Vision  Systems  of  RUAV 

The  onboard  vision  system  of  RUAV  consists  of 
color  CCD  camera,  color  tracking  devices  and 
dedicated  vision  processing  computer.  The  current 
vision  processing  system  uses  color  segmentation  to 
identify  objects  from  camera  images.  In  this  scheme, 
the  ground  agents  are  marked  with  certain  color  of 
high  contrast  with  background.  The  vision  computer 
computes  the  localized  coordinate  of  the  target  and 
then  converts  it  into  the  global  coordinates  using  the 
position  and  attitude  information  estimated  by  the 
onboard  vision  system.  The  estimated  positions  of  the 
evaders  are  then  reported  to  the  map  builder  via 
wireless  communication  network, 

2.4  Map  Building 

Based  on  sensory  information,  this  layer  dynamically 
builds  a  representation  of  features  of  the  environment 
that  is  relevant  for  the  navigation  of  agents.  This 
map,  which  contains  observation  and  possible 
positions  of  objects  of  interest,  will  be  sent  to  the 
strategy  planner  and  used  as  a  basis  for  planning  and 
performing  of  tasks.  Depending  on  the  scenarios  and 
the  computational  load  of  the  onboard  computers, 
this  is  done  either  on  each  agent  or  centrally  by  our 
ground  station. 

2.5  Communication  Network 

The  multi-agent  scenario  requires  communication 
channel  among  the  participating  agents.  Since  the 
participating  agents  are  moving  in  the  arena  freely, 
wireless  communication  is  preferred  unquestionably. 
The  scenario  that  the  agents  in  each  group  exchange 
the  position  information  requires  peer-to-peer 


communication  setup  than  one-to-one  format,  which 
is  typical  setup  for  serial  communication.  In  this 
research,  Lucent  Technology’s  Orinoco  system  is 
chosen  as  the  communication  backbone,  Orinoco 
system  supports  TCP/UDP/IP  in  user-selectable 
speeds  from  2Mbps  to  11Mbps  in  2.4GHz  band. 

In  our  small-sized  test  field,  we  notice  no  packet  loss, 
but  can  expect  imperfect  communication  between 
agents  in  a  real  scenario.  One  of  the  active  areas  of 
research  within  the  group  is  on  how  to  share 
information  and  coordinate  actions  with  an  unreliable 
communication  channel. 


2.7  RUAV  Platform 

An  RUAV  should  be  able  to  maneuver  through  the 
given  waypoints  while  searching  for  ground  agents 
using  vision  processor.  Small-size  helicopters  are 
chosen  for  the  aerial  agents  because  of  their  flexible 
maneuverability  such  as  vertical  take-off/landing, 
hovering,  side-step  flight  and  forward  flight.  The 
capability  of  hovering  and  low  velocity  forward/ 
lateral  maneuver  is  very  valuable  when  they  need  to 
track  ground-based  agents.  They  can  be  also  operated 
in  a  relatively  small  area  because  they  do  not  need  a 
runway  to  take  off  and  land. 

Four  different  sizes  of  helicopters  are  considered  for 
RUAV  application:  Yamaha  industrial  helicopter 
RMAX  and  R-50,  Bergen  Industrial  Twin,  and 
Kyosho  Concept  60.  Among  these,  Yamaha 
helicopters  are  used  as  the  aerial  agents  for  PEG 
because  of  their  sufficient  payload  and  reliability. 
The  aerial  agents  are  equipped  with  autopilot  system 
and  vision  processing  computer.  The  autopilot  system 
is  divided  into  navigation  sensor  suite  and  the  flight 
computer.  The  navigation  sensor  suite  consists  of 
inertial  navigation  system  (INS),  global  positioning 
system  (GPS),  and  ultrasonic  height  sensors.  For 
INS,  Boeing  DQI-NP  is  chosen  for  the  built-in  data 
processing  capability,  compact  size  and  GPS- 
integrability  option.  NovAtel  GPS  MillehRT-2 
provides  the  position  information  updated  at  4HZ 
with  approximately  2  cm.  Ultrasonic  sensors  provide 
the  local  altitude  information  valuable  for 
autonomous  take-off  and  landing  stage. 

Flight  control  computer  (FCC)  is  Intel  Pentium-based 
IBM-PC  compatible  system  in  PC  104  industrial 
standard.  FCC  is  in  charge  of  sensor  management, 
control  command  generation  and  wireless 
communication.  The  onboard  flight  control  software 
is  running  on  QNX  realtime  operating  system. 
Control  outputs  for  four  channels,  main  rotor 
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collective  pitch,  tail  rotor  collective  pitch,  main  rotor 
longitudinal  and  lateral  cyclic  pitch,  are  computed 
using  a  programmed  control  law  for  stabilization  and 
tracking  of  the  host  vehicle.  FCC  also  downloads  the 
current  flight  status  using  Orinoco  wireless  LAN  at 
2.4GHz. 

2.8  Ground  Robots 

Since  the  experimental  set-up  requires  the  UGYs  to 
be  operated  outdoors  to  internet  with  RUAVs, 
Aetivmedia  Pioneer  2-ATs  were  chosen.  These 
rugged  UGVs  are  four  wheel  drive,  differential  skid¬ 
steering  robots  designed  for  all-terrain  operations. 

For  self-localization  or  position  estimation  of  robots, 
the  usual  approach  is  based  on  a  combination  of  dead 
reckoning  with  periodical  compensation  using 
external  information  to  keep  the  accuracy  of  position 
from  gradually  decaying.  This  external  information  is 
obtained  from  active/passive  landmarks  or  from  the 
matching  between  a  global  map  and  the  information 
provided  by  agents.  However,  we  are  mainly 
interested  in  operations  in  which  a  priori 
environmental  information  is  unavailable,  so  GPS  is 
chosen  as  the  primary  navigation  sensor.  Other 
components  for  sensing  and  navigation  include 
position  encoders,  digital  compass,  and  range-finding 
ultrasonic  sonar  transducers. 

Currently  the  UGVs  are  programmed  for  navigation 
in  an  outdoor  environment  using  Saphira  motion 
control  software.  The  strategy  planner  can  access  the 
integrated  onboard  PC  through  RS232.  Then  the 
Saphira  OS  accepts  the  motion  command  from  the 
upper  layer  and  steers  the  host  vehicle  to  the  desired 
position  by  transmitting  appropriate  motor  commands 
to  the  robot.  The  robot  has  two  independent  control 
channels  for  transition  and  rotation  and  commands  to 
control  them  can  be  issued  and  executed 
concurrently. 

Shaphira  also  has  several  functions  to  look  as  the  raw 
sonar  readings  and  determine  if  an  obstacle  is  near 
the  robot.  These  detection  functions  either  look  at  a 
rectangular  region  in  the  vicinity  of  the  robot  or  a 
portion  of  a  half-plane. 


3.  Multi-player  Pursuit-Evasion  Game 

To  validate  the  proposed  architecture,  a  particular 
kind  of  game  scenario  is  conceived  in  which  a  group 
of  pursuers  are  attempting  to  capture  another  group 
of  evaders  within  a  fixed  and  unknown  arena  which 


may  contain  fixed  obstacles.  The  discrete-time  game 
is  implemented  algorithmically  on  a  discrete  map 
over  which  the  pursuers  assign  a  probability  of 
evader  occupation.  The  pursuers  use  their 
observations  at  each  time  instant  i  e  T  =  {l, 2,...}  to 

update  the  perceived  state  of  the  arena  and  then 
predict  the  state  of  the  arena,  particularly  the  location 
of  the  evaders,  at  the  next  time  instant. 

We  denote  by  y  (t)  the  set  of  measurements  taken  by 
the  pursuers  at  each  time  ie  T .  Every  y(/)  is 

assumed  to  be  a  random  variable  with  values  in  a 
measurement  space  V .  Each  control  action 

u is  a  function  of  Yt  ={y(l),...,y(t)},  the 
sequence  of  measurements  taken  up  to  time  t,  By  the 
pursuit  policy  we  mean  the  function  g;  Jf*  -» t/ 
such  that 

ufr+l)  =  g(Yt) 

for  each  te  where  y*  is  the  set  of  all  finite 
sequences  of  if .  We  say  that  an  evader  was  found  at 
time  te  when  one  of  the  pursuers  is  located  at  a 
cell  for  which  the  conditional  probability  of  the 
evader  being  there,  given  Yt,  exceeds  a  certain 

threshold.  T°  represents  the  first  time  instant  at 
which  one  of  the  evaders  is  found.  E-  j^T°  J ,  the 

expected  value  of  T°  under  a  specific  pursuit  policy 
g :  J/*  Y/  provides  a  good  measure  of  the 
performance  of  g .  However,  since  the  dependence 
of  2sj[r]  on  g  is  very  complex,  finding  an 

optimal  policy  that  minimizes  is  difficult 

and  not  suited  for  real-time  applications.  In  this 
research,  we  will  concentrate  on  suboptimal  policies 
and  compare  the  performance  of  different  strategies 
with  three  pursuers  and  varying  number  of  evaders. 

Suppose  that  np  pursuers  try  to  find  a  single  evader. 
Then,  at  each  te  T ,  the  position  of  the  pursuers 
(oJe^T^and  the  position  of 

the  evader  xf  (t)  e  /¥  can  be  considered  as  random 
variables.  If  a  model  for  the  motion  of  the  evader  is 
assumed  to  be  known,  for  Vxe  X ,  Yt{i)e  3f* , 
pe(x,Y,)  =  P1(xe(t  +  l)  =  x\Yt  =  Yt) 

can  be  computed  recursively  as  a  deterministic 
function  of  the  last  measurement  y(t )  in  Y, 
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and pjx.y^,),  where  Y,_t  is  the  first  r-lelements  of 
Yt.  This  conditional  probability  will  be  used  to 
generate  pursuit  policy  u(t) ,  as  to  be  explained  later. 

Once  waypoints  for  movement  are  planned  as  u(t) ,  a 
realizable  trajectory  is  generated  by  incorporating  the 
continuous-time  dynamic  vehicle  model  and 
regulation  layer.  The  game  terminates  when  the 
pursuers  capture  all  the  evaders  in  the  arena. 

Among  the  many  development  tools  used  to  simulate 
hybrid  behavior,  two  different  simulation 
environments  are  adopted  in  this  research:  Hybrid 
System  Tool  Interchange  Format  (SHIFT)  and 
MatLAB/Simulink.  Although  SHIFT  offers  a  number 
of  features  ideal  for  the  simulation  of  a  dynamically 
evolving  hybrid  network,  MatLAB/Simulink  is  also 
used  in  this  research  because  of  its  fast  computation 
and  convenient  graphical  user  interface.  Figure  4 
shows  all  components  of  the  system  described  in 
Section  2  built  for  simulation. 


Figure  4.  Hierarchical  Structure  Implemented  in 
Matlab/Simulink 


the  evader 


Figure  5  shows  the  trail  of  three  pursuers  in  100m  x 
100m  arena  until  they  capture  an  evader,  when  the 
capture  is  defined  as  collocation  of  a  pursuer  and  an 
evader.  Figure  6  shows  the  display  environment  for 
the  visualization  purpose  of  our  scenarios. 


Figure  6.  Simulation  Display 


An  agent  following  an  efficient  strategy  would  be 
expected  to  perform  thorough  local  searches,  but  at 
the  same  time  adapt  on  a  global  scale  to  information 
supplied  by  other  agents.  A  classical  search  strategy 
is  an  A*-type  search,  whereby  each  agent  moves 
towards  the  global  location  which  has  the  highest 
discounted  probability  of  being  occupied  by  an 
evader.  The  discounting  factor  is  designed  to  be 
proportional  to  the  distance  from  the  agent  to  each 
location.  When  xp{t)  denotes  die  position  of  pursuers 


at  time  t,  this  strategy  can  be  expressed  as 

/  ft,  w 

±p.M 

8a=  argmin  d  }, argmax  - r 

JE®'  . wnr  1) 


.where  d  is  the  distance  function  according  to  the 
movement  of  pursuers  and  Z/(x(o)c2/  is  the  set  of 
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cells  reachable  from  x(0  in  a  single  step,  i.e.  for 
every  x={xx,.„,xn}e  , 

V{w, . 

Another  obvious  candidate  for  a  search  policy  is  just 
a  simple  greedy  strategy,  whereby  each  agent  moves 
to  the  cell  within  its  range  that  has  the  highest 
probability  of  containing  an  evader  at  the  next 
instant,  i.e,, 

88=  argmax  Z^Pe(vk’Y)- 

{n,...,vnp]ei/(x(,))k=l 

The  former  policy  gA  leads  to  relatively  poor 

performance  as  the  agents  do  not  make  complete 
searches  of  the  map,  but  rather  traverse  the  map  back 
and  forth  frequently.  Moreover,  without  the  proper 
coordination,  pursuers  tend  to  move  toward  the  same 
place,  losing  the  advantage  of  having  multiple  agents 
covering  a  wider  region  (Figure  7). 

In  [8],  it  was  shown  that  the  expected  time  needed  to 
find  the  evader  is  finite  under  greedy  strategy  gB .  In 

fact,  a  simple  greedy  strategy  does  have  good 
performance  and  computationally  efficient  as  its 
computational  cost  depends  only  on  t/Ut))  rather 
than  the  entire  <lf .  The  drawback  to  the  pure  greedy 
policy,  though,  is  that  (even  in  the  perfect 
communication  case)  each  agent  effectively  works  on 
its  own  and  does  not  take  advantage  of  information 
gathered  by  other  agents.  Considering  that  our 
interest  is  in  the  hierarchical  and  team- wise  approach 
to  the  pursuit-evasion  problem,  a  pure  greedy 
strategy  is  not  very  appealing  either  (Figure  8). 

One  combined  policy  found  to  be  especially  effective 
is  where  agents  in  general  follow  a  greedy  strategy, 
but  (a  subset  of  the  agents)  are  ’dispatched’,  i.e, 
follow  a  trajectory,  to  locations  where  enemies  are 
thought  to  have  been  seen.  This  policy  utilizes  the 
thorough  local  search  provided  by  a  greedy  strategy, 
yet  still  allows  agents  to  react  to  Information 
provided  by  other  agents  (Figure  9). 

One  of  the  extensions  to  the  standard  pursuit-evasion 
game  we  have  considered  is  the  inclusion  of  a 
supervisory  agent  [9],  Ibis  supervisory  agent  may 
have  more  accurate  sensors  and  a  longer  range  of 
vision,  but  cannot  capture  the  evaders.  These 
supervisory  agents  can  be  thought  of  as  roughly 
analogous  to  satellites  or  Airborne  Warning  and 
Control  Systems  (AWACS)  platforms  in  the  modem 


battlefield.  These  supervisory  agents  traverse  the  map 
in  a  pre-defined  manner  and  report  enemy  sightings. 
If  an  agent  is  following  a  pure  greedy  policy,  though, 
he  will  effectively  ignore  these  updates  and  no 
improvement  in  performance  is  noticed  when  a 
supervisor  is  added  to  agents  using  pure  greedy 
strategies.  The  greedy/dispatch  strategy  mentioned  in 
the  previous  paragraph  lends  itself  to  a  scenario  with 
a  supervisor  quite  naturally,  and  an  improvement  in 
performance  Is  noted  versus  the  case  without  a 
supervisor  (Figure  7-9).  Figures  7-10  show  the  data 
all  averaged  from  100  runs.  In  Figures  7-9,  five  left 
columns  have  no  AWACS  while  the  five  right 
columns  have  one  AWACS. 

Figure  10  shows  a  comparison  of  the  three 
aforementioned  policies  when  ‘intelligence’  is  added 
to  the  evaders.  In  an  attempt  to  add  more  realism,  the 
evaders  were  designed  to  hide  so  as  not  to  be  seen  by 
the  supervisory  agents.  Once  an  evader  was  spotted 
by  a  supervisory  agent,  it  would  hide  near  an  obstacle 
until  the  supervisor  had  moved  Itself  out  of  range. 
The  initial  spotting  of  the  evader  was  sufficient, 
though,  to  dispatch  a  pursuer  to  the  area,  and  in  this 
scenario,  our  combined  policy  shows  its  true 
advantage  over  the  pure  greedy  policy. 


number  of  enemies 


■  Enem  y5 
ElEnemy4 
DEnemyS 
E8  Enem  y2 
□  Enem  yl 


Figure  7.  A*  Policy 


number  of  enemle* 


■  EnemyS 
E3£nemy4 
□  Enemy3 
BEficmy2 
OEnemyl 


Figure  8.  Greedy  Policy 
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4.  Conclusion 


number  ol  enemies 


■  EnemyS 
OEnemy4 
□  Enemy3 
BEnemy2 
SEnemyl 


Figure  9.  Combined  Policy 


Figure  10,  Comparison  of  the  three  policies  for  three 
random(R)  and  intelligent(l)  evaders 


Our  research  has  focused  primarily  on  the 
coordination  of  the  movement  of  agents,  but  recently 
an  emphasis  has  been  placed  on  the  coordination  of 
information  as  well.  The  previous  simulations  all 
assumed  perfect  communication  between  all  agents, 
i.e.  each  agent  would  update  its  own  map  with  the 
sensor  reading  of  all  the  agents  and  thus  all  agents 
effectively  shared  one  complete  map.  A  much  more 
realistic  scenario  is  the  situation  of  imperfect 
communication  between  agents.  Communication  may 
be  disrupted  or  corrupted  for  numerous  reasons 
including  propagation  delays,  the  physical  location  of 
agents,  loss  of  an  agent,  or  jamming.  The  design  of  a 
network  and  communication  structure  that  will  be 
robust  to  errors  and  loss  of  communication  between 
agents  is  of  extreme  interest.  A  strategy  without 
communication  between  agents  would  intuitively  be 
expected  to  give  poor  performance  and  this  has 
indeed  been  confirmed  by  simulation.  Various 
protocols  for  information  sharing  as  well  as  flexible 
network  structures  are  being  explored  to  facilitate  the 
creation  of  robust  communications.  The  movement 
strategies  will  also  be  tuned  to  correspond  to  the 
reliability  of  information  shared  between  agents. 


This  paper  formulates  a  setup  for  the  development  of 
pursuit-evasion  games  using  multiple  RUAVs  and 
UGVs.  The  development  of  control  and  coordination 
algorithms  and  their  test  in  simulation  have  been 
completed  and  the  development  of  the  physical 
testbeds  is  as  well  nearing  completion.  Hie  future 
emphasis  of  this  research  will  be  on  the  investigation 
of  more  robust  methodologies  for  complex  and 
unreliable  configurations.  The  implementation  of  the 
algorithms  and  software  on  the  UAVs  and  UGVs 
using  the  outlined  methodology  will  be  followed  by 
the  completion  of  an  actual  field  exercise. 
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